National Repository of Grey Literature 240 records found  1 - 10nextend  jump to record: Search took 0.01 seconds. 
Integration of Voice Technologies on Mobile Platforms
Černičko, Sergij ; Černocký, Jan (referee) ; Schwarz, Petr (advisor)
The goal of the thesis is being familiar with methods a techniques used in speech processing. Describe the current state of research and development of speech technology. Project and implement server speech recognizer that uses BSAPI. Integrate client that will use server for speech recognition to mobile dictionaries of Lingea company.
Multimedia support of the course BSIS
Pasečný, Jan ; Šebesta, Vladimír (referee) ; Sigmund, Milan (advisor)
This paper takes aim at creating a consistent form of study materials, supplemented with illustrative examples, for Signals and systems subject. The thesis starts with basic characteristics of acoustic, image, biological and communication signals. Characteristics of linear signals and AD&DA conversion has been added to the next part and to complete the submission, discrete signals follow. Diploma thesis as a whole contains basic theoretical description of problematics, which it tries to supplement with interesting examples, connections, graphs and matlab scripts for illustrative presentation of mentioned problematics.
State of the art speech features used during the Parkinson disease diagnosis
Bílý, Ondřej ; Smékal, Zdeněk (referee) ; Mekyska, Jiří (advisor)
This work deals with the diagnosis of Parkinson's disease by analyzing the speech signal. At the beginning of this work there is described speech signal production. The following is a description of the speech signal analysis, its preparation and subsequent feature extraction. Next there is described Parkinson's disease and change of the speech signal by this disability. The following describes the symptoms, which are used for the diagnosis of Parkinson's disease (FCR, VSA, VOT, etc.). Another part of the work deals with the selection and reduction symptoms using the learning algorithms (SVM, ANN, k-NN) and their subsequent evaluation. In the last part of the thesis is described a program to count symptoms. Further is described selection and the end evaluated all the result.
Modern coding of speech signals using overcomplete models
Zapletal, Ondřej ; Průša, Zdeněk (referee) ; Rajmic, Pavel (advisor)
The theoretical contents of this thesis are studies of overcomplete models. Those are the models of signals, on which is set for their parametrization more variables, than it's necessary and consequently there's computed so-called sparse solution via iteration algorithms. A goal of this analysis is a selection just of the considerable (sparse) parameters. The theory is based on a linear algebra, vector spaces, bases and so-called frames. The task of the individual project of this thesis is a description and simulation of two speech coders: a classical coder based on linear predictive speech coding and a coder, that's making use of overcomplete stochastic ARMA processes models. A part of their realization is to simulate their decoders and a analyze their reconstruction quality. For their realization there is used MATLAB and an overcomplete models' library (toolbox frames).
Speech segmentation
Andrla, Petr ; Míča, Ivan (referee) ; Sysel, Petr (advisor)
The programme for the segmentation of a speech into fonems was created as a part of the master´s thesis. This programme was made in the programme Matlab and consists of several scripts. The programme serves for automatic segmentation. Speech segmentation is the process of identifying the boundaries between phonemes in spoken natural languages. Automatic segmentation is based on vector quantization. In the first step of algorithm, feature extraction is realized. Then speech segments are assigned to calculated centroids. Position where centroid is changed is marked as a boundary of phoneme. The audiorecords were elaborated by the programme and a operation of the automatic segmentation was analysed. A detailed manual was created to the programme too. Individual used methods of the elaboration of a speech were in the master´s thesis briefly descripted, its implementations in the programme and reasons of set of its parameters.
Comparison of Accuracy of Siri, Cortana and Google
Procingerová, Lucie ; Černocký, Jan (referee) ; Szőke, Igor (advisor)
The aim of this thesis is to compare the accuracy of translation of spoken word into text using several services. Primary it is about applications from Apple Inc., Microsoft Corporation and Google Inc., but there is also included several others, mostly available on-line. This document contains a descriptionn of the problem, analyzes the progress for each service. Subsequently, the test results are analyzed and compared with the reference outputs. In conclusion, there is a discussion of these experiments.
End-to-End Speech Recognition for Low-Resource Languages
Sokolovskii, Vladislav ; Schwarz, Petr (referee) ; Karafiát, Martin (advisor)
Oblast automatického rozpoznávání řeči začala přijímat end-to-end řešení neuronové sítě pro vytváření rozpoznávačů řeči. Povaha datového hladu těchto typů systémů však umožňuje vytvářet rozpoznávače pouze pro jazyky s velkými zdroji, jako je angličtina, čínština nebo španělština. Ve scénářích s nízkými zdroji je třeba vyvinout některá řešení, která zmírní problém nedostatku dat. Jednou z nejúčinnějších technik je doladění předtrénovaného modelu. Problém se stávajícími přístupy ladění spočívá v tom, že sada tokenů cílového a zdrojového jazyka se obvykle liší. To je důvod, proč předchozí přístupy k učení vícejazyčného přenosu vyžadovaly změnu výstupní vrstvy nebo smíchání tokenů z různých jazyků ve výstupní vrstvě, případně použití univerzální sady tokenů anebo samostatné výstupní vrstvy pro každý jazyk. To je nežádoucí, jelikož sdílení napříč jazyky je v tomto případě latentní a neovladatelné ve výstupním prostoru, když jsou grafémy specifické pro daný jazyk disjunktní. Proto tato práce navrhuje mapování tokenů do společné sady před začátkem předtréninku. Stávající řešení spočívá v transliteraci zdrojového jazyka do cílového, novým přístupem je romanizace, kde je sada tokenů cílového jazyka romanizována tak, aby odpovídala anglické abecedě. Následně lze diakritiku z romanizovaných hypotéz obnovit pomocí dalšího modelu obnovy. To má výhodu ve zvýšení sdílení v prostoru výstupního grafému.
Fast and Accurate Keyword Spotting System
Lenčéš, Marián ; Karafiát, Martin (referee) ; Schwarz, Petr (advisor)
This bachelor's thesis deals with fast and accurate detection of keywords from audio records. The aim of was to study possibilities of word detection and to create several types of language models. These were then to be compared to each other. We focus here on the detection of keywords from English spoken audio records.
Numerical simulation of of human voice propagation through the vocal tract and in the space around the body
Batelka, Jiří ; Hájek, Petr (referee) ; Švancara, Pavel (advisor)
This master's thesis handles description of the source-filter theory of voice production, anatomy of larynx, possible approaches to voice production modelling and selected works using these approaches in first chapter. Brief description of selected quantities used in acoustics and model creation follows. Models of only the head and head with female and male torso are created, including mesh testing to determine suitable element size. Models created in this thesis focus on description of voice propagation primarily in front of body and on influence of torso on sound propagation. Inclusion of torso results in fluctuations in frequency domain in range from 1 000 Hz to 8 000 Hz, more pronounced near lower frquencies. In transverse plane the presence of torso manifests in lower SPL in front of mouth and higher SPL on the sides for several frequencies. Regions with decrease of SPL in front of mouth are coindicent with frequencies, where higher SPL on sides in comparision with direction in front of the mouth is evident. These observations are in agreement with other works. No significant differences were observed between models with different torsos in the transverse plane. Below the transverse plane differences between models with different torsos can be observed, for example for some frequencies decrease in SPL isn't observed in front of mouth in directivity diagrams for model with male torso.

National Repository of Grey Literature : 240 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.